Search results for "Vanishing gradient problem"

showing 1 items of 1 documents

An In-Depth Experimental Comparison of RNTNs and CNNs for Sentence Modeling

2017

The goal of modeling sentences is to accurately represent their meaning for different tasks. A variety of deep learning architectures have been proposed to model sentences, however, little is known about their comparative performance on a common ground, across a variety of datasets, and on the same level of optimization. In this paper, we provide such a novel comparison for two popular architectures, Recursive Neural Tensor Networks (RNTNs) and Convolutional Neural Networks (CNNs). Although RNTNs have been shown to work well in many cases, they require intensive manual labeling due to the vanishing gradient problem. To enable an extensive comparison of the two architectures, this paper empl…

Structure (mathematical logic)Vanishing gradient problemPhrasebusiness.industryComputer scienceDeep learning05 social sciencesPattern recognition010501 environmental sciences01 natural sciencesConvolutional neural networkSet (abstract data type)0502 economics and businessBenchmark (computing)Artificial intelligence050207 economicsbusinessSentence0105 earth and related environmental sciences
researchProduct